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Abstract. We discuss the use of Kaniadakis' At-exponential in the construction of a statistical manifold 
modelled on Lebesgue spaces of real random variables. Some algebraic features of the deformed exponential 
models are considered. A chart is defined for each strictly positive densities; every other strictly positive 
density in a suitable neighborhood of the reference probability is represented by the centered Iuk likelihood. 



1 Introduction 



G. Kaniadakis [HElElll], based on arguments from Sta- 
tistical Physics and Special Relativity, has defined the k- 
deformed exponential for each a; S M and — 1 < k < 1 to 
be 

■ (1) 



exp^ {x) = exp 



with special cases 



/ ^ I (kX + Vl + k'^x'^) , if K 7^ 0, /r,\ 

exp„ (x) = <^ V ^ ^ T n 

I exp X, UK — (J, 

and derivation formulae 

exp/(a;) = (1 + k^x^)-'/^ exp, (x) > (3) 

■ exp/(a;) > (4) 



exp, (x) 



,,, , Vl + K^X^ — K^X 

' T 1 — 

1 + K^X^ 



For each k 7^ 0, y = (exp, (a;))'' and x are related by the 
polynomial equation 



y — 2Kxy —1 = 



(5) 



Therefore, the graph of (exp,)'' is the upper branch of a 
hyperbola: 

- = i{y-l)^ .>o. (6) 

For each given k, the function exp, maps M onto M>, 
it is strictly increasing and it is strictly convex. If k 7^ 0, 
its inverse function is 



inK(y) = — — — , y>o, 



with derivative 



2k 



y'" + y "1 



2 y 



(7) 



(8) 



and it is called K-deformed logarithm. The function In, 
maps M> unto R, is strictly increasing and is strictly con- 
cave. Both deformed exponential and logarithm functions 
exp, and In, reduce to the ordinary exp and In functions 
when K — > 0. Moreover, the algebraic properties of the ex- 
ponential and logarithmic function are partially preserved, 
because 

exp, (x)exp, (-a;) = 1, In, (y) -f In, (y'^) = 0. (9) 
Kaniadakis [3] defines, among others, two commutative 

K 

group operations (M, 0) and (K>,(8)) in such a way that 
exp, is a group isomorphism from (M, +) onto (R>,(g)) 

K 

K 

and also from (M, ®) onto (R>, x): 

exp, (^xi © = exp, (a;i) exp, (2:2) , (10) 
exp, {xi + X2) = exp, (xi) exp, (^2) , (11) 

and, equivalently, 

Ihk ^yi «> y2^ = hr, (yi) + In, (y2) , (12) 

Iiik (yiy2) = (yi) © In^ (y2) ■ (13) 

Therefore, the binary operations © and <^ are defined by 

K, 

K 

Xi®X2^ InK (exp, (a;i) exp, (2:2)) (14) 
yi®V2= exp, (In, (yi) + In, (ya)) (15) 

Because of ^ , the deformed operation have the same in- 
verse that the usual operations: 



a;i®(-x2) = 0, yi®y2 



1. 



(16) 



The operation ® is defined on positive real numbers. How- 
ever, ()15p can be extended by continuity to non-negative 
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real numbers: 



0«)y2 = yi«)0 = 0(g)0 = 



We want to derive defining relations for the K-deformed 
operations in polynomial form. This is obtained by re- 
peated use of (IH) followed by algebraic elimination of the 
unwanted indeterminate. Symbolic computations have been 

done with [5]. First, we want to find x — xi® X2, i.e. such 
that exp^ (x) = exp^ (xi) exp^ {x2)- Let yi = (exp^ i^i))'^ 
and 2/2 = (exp^ (2:2))''. As 

(exp^ (a;))" = (exp^ (xi) exp^ (xa))" = yiy2, (18) 

Equation ([5]) gives a system of three quadratic equations 
in the indeterminates xi, X2, yi, 2/2, a^, Algebraic elimi- 
nation of yi , 1/2 gives the polynomial equation 

x"* - 2 {2k^xIxI + xl+ xl) x^ + {xl -xlf = 0, (19) 
whose solution gives 



Xl e X2 = Xiy^l K^xl +X2\Jl + K'^xj. (20) 

We will use later on the derivation formula 
d{xi © X2) 



v/1 + n'^xj 



dxi 



Second, we want to find z = yyi (g) yaj • As before, 

2/1 = (exp„ {xi)Y, y2 = (exp„ (2^2))", while 

z=(exp,(xl+X2))^ (22) 

Equation ([5]) gives three quadratic equations in the inde- 
terminates xi,X2,yi,y2, z, K. Elimination of xi,X2 gives 
the polynomial equation 

yiy2z^ + (1 - yiy2){yi + y2)z - yiy2 = (23) 

It is remarkable that this equation does not depend on n. 
An explicit solution is obtained by solving the quadratic 
equation. A possibly more suggestive solution is obtained 
as follows. First, we reduce to the monic equation 



1 



yiy2 



{yi+y2)z-l - 



and denote the two solutions as 2: > and — l/z. There- 
fore, 

z -- = (yi - -\ + (y2 - -\ (25) 
z \ vij \ y2j 

The K-logarithm defined in ([7]) is reminiscent of a fam- 
ily of transformation well known in Applied Statistics un- 
der the name of Box-Cox transformation [6] or power 
transform. For data vector yi ,...,?/„ , with yi > for all 
i, the power transform is: 

(A) y^~l 



The parameter A is to be estimated in order to get the 
best fit to the Normal distribution of the transformed data 

(17) 

vector. When compared with the power transform, the k- 
deformed logarithm x = In^ (y) has the extra feature of 
the symmetry induced by the term —y~'^ and it would be 
interesting to study it from the point of view of transfor- 
mations to normality. We are not further discussing this 
issue here. 

J. Naudts [7[|8l[9lfT0] has presented a general discus- 
sion of a class of deformed exponentials that contains In^ 
and exp^, with applications in Information Theory and 
Statistical Physics, based on a notion of generalised en- 
tropy. We are not discussing generalised entropies in this 
paper. The purpose of the present paper is to extend to k- 
deformations the non-parametric and geometric approach 
to statistical manifolds as it was developed by the Author 
and co-workers in [TTlfT^lfTBlll4l[T5] . Such an approach is 
designed to present in a non-parametric and fully geomet- 
ric way work by S-i. Amari on Information Geometry, see 
[16] and the joint monograph with H. Nagaoka [17]. Ac- 
cording to those references, the the power transform is ap- 
plied to probability densities and re-named a-embedding, 
a = 1 - 2/A. 

To explain the idea of a-embedding, we consider a gen- 
eral setting. Let f2 be any set, !F a cr-algebra of subsets, 
fi a reference probability measure, e.g. the uniform distri- 
(21) bution. A density p of the measure space (f2,JF, /i) is a 
non- negative random variable such that J pd^j, ~ 1. We 
write the expected value of a random variable U with re- 
spect to the probability measure p • /i as Ep [U] = J Up dfj,. 
For a > 1, the mapping p t-^ p^/" maps p into the unit ball 
of the Lebesgue space L°-{f2, !F, /i). Other parametrization 
of the same setting are: A = 1/a (Box-Cox), k = 1/a (Ka- 
niadakis), a = {a — 2) /a (Amari). The basic idea is to 
pull-back the structure of the unit ball of L"' to construct 
a Banach manifold on the set of densities A4>. The actual 
construction is not straightforward in the infinite dimen- 
sional case, because the image set of the a-embedding has 
empty interior in the Lebesgue space because of the non- 
negativity constraint p^^" > 0. However, the approach 
works perfectly well in the cases of either a parametric 
model or a general model over a finite state space. 

Variants of the a-embedding has been studied by [T^] , 
[l8] , [19] . There is also literature covering the non-commutative 
(quantum) case, which is not treated here, e.g. [20], 1211 . 
(24) ^. 

The present paper is aimed to discuss what happens if 
we use In^ instead of the a-embedding in the construction 
of the statistical manifold. The related issue of the algebro- 
statistical aspects of finite state space models is also dis- 
cussed. See the books p3 ] [24 ] [25] on Algebraic Statistics 
and 1^ on the relations between Algebraic Statistics and 
Information Geometry. 

Section 2 is an introduction to both the algebraic and 
the geometric features of the K-deformed Gibbs model. 
Section 3 introduces a system of charts based on the re- 
exponential; the resulting differentiable manifold is mod- 
eled on the space of centered and l/re-integrable random 
(26) variables. Section 4 discusses some properties of the tan- 
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gent bundle; only a few basic results are presented. Section 
5 is a short discussion on how the present construction re- 
lates to our formalism of exponential statistical manifolds 
and to other Authors' work. A full account the totality 
of the existing literature on Information Geometry is out- 
side the scope of the present paper and will be pubhshed 
elsewhere. 



2 K-Deformed Gibbs model 

On a finite state space 12, equipped with the energy func- 
tion U : ^ R> , we want to discuss the K-deformation of 
the standard Gibbs model. There are two options, as there 
are two different presentation of the normalizing constant. 

The first option would be to consider the statistical 
model 



therefore 



exp, {BU{x)) 

m 

exp^ ( e\J{x) ® In^ 



(27) 



m 



or 



with i,^{d)=\^^Z{B), 
lnKp(a;; B) = 



BU{x)^l + k2(^^(0))2 _ ^40)^1 + n^B^U{xf (28) 

To the best of our knowledge, this model is not considered 
in the literature, therefore we are not discussing it here. 
The second option is to define the generalised model 

as 

p{x-B)^e^vAeU{x)-i;^{B)) (29) 
= exp„ {BU{x)) ® exp^ (-V'«(6')) , 



where iI^k^O) is the unique solution of the equation 
J^exp, {BU{x)-i>M) = ^- 



(30) 



This is the model that has been considered in [3], on the 
basis of the classical maximum entropy argument. It is a 
particular case of more general models as discussed by '27] 
and by ^9,10 and . We focus here on the algebraic and 
geometrical features of the model, not on its derivation 
from general principles. 

The one-parameter statistical models ((27|) and (l29l) are 
different unless k = 0. This fact marks an important dif- 
ference between the theory of ordinary exponential mod- 
els and K-deformed exponential models. For example, the 
derivative with respect to B of the left hand side of (I5D1) is 



U{x)~i;'M 



E-, : 

xen ^Jl + K^ {BU{x)~ij^{B)y 



:exp„ iBUix)-M0)), 
(31) 



U-tp'iB) 



1 + {BU-4>n{B)y 



0, 



(32) 



where [F] = J2x^i^)pi^'^^)- If '« = 0, we have the 
usual formula ip'if^) =^e[U]. 

From the geometrical point of view, the second ap- 
proach has the advantage that the model for the In^- 
probabilities is linear. Here, by geometry we mean differ- 
ential geometry of statistical models, i.e. the construction 
of an atlas of charts representing the subset of probability 
densities unto open sets of a Banach space, see [28]. Before 
moving into this approach, we discuss the duality of the 
Gibbs model and its algebraic features. 

Let V — Span (1, U) and V-^ the orthogonal space, i.e. 
V g V-^ if, and only if, J2x ^(^) ~ ^ ^"^^ X^x v{x)U{x) — 0. 
It follows from Equation (|29]) that 



^ w(x) In, (p(a;; ^^)) - 0, v e (33) 

Vice versa, if a strictly positive probability density func- 
tion p is such that ln„ p is orthogonal to V-^ , then p be- 
longs to the /c-Gibbs model for some B. 

For each v G , we can take its positive part and 
its negative part v~ , so that v — — and v^v^ = 0. 
Equation (|33p can be rewritten as 



X : v{x)>0 



v^{x) In, {p{x)) 



X : v{x)<0 



V (x)ln, (p(x)) 



(34) 

The interpretation of (|34p is the following. As J^x '^(^) ~ 
0, we have 



^^(^) = J2^ (x) ^X. 



(35) 



xefi 



xen 



It follows that ri — w^/A, and r2 — /X are probability 
densities (states) with disjoint support, so that (|34]) can 
be restated by saying that a positive density p belongs to 
the K- Gibbs model if, and only if. 



E,, K (P)] = Er. [ln« (p)] 



(36) 



for each couple of densities ri, r2 such that rir2 — and 
Eri [U] = Fir2 [U], where E^ [V] denotes the mean value of 
V with respect to r, i.e. J^x V{x)r{x). 

li V G V'^ happens to be integer valued, using the k- 
algebra and the notation 




we can write (jMJ) as 



p{x) 



(x) 



(37) 



, .0V (x) 

p{x)- , (38) 



X : i!(a:)>0 



X : i'(2;)<0 
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It should be noted that ([38| is continuous function of 
p{x) , X € n and it does not require the strict positivity of 
eachp(x), x £ i7. Therefore, the same equation is satisfied 
by all limits (if any) of the model ([29|l . Moreover, (|38p is a 
K-polynomial invariant for the K-Gibbs model, cf. in [15j 
the discussion of the case k = 0. Vice versa, each positive 
probability satisfying ((38|) belongs to the K-Gibbs model. 

The set of all K-polynomial equations ([55)1 is not a fi- 
nite set, because each equation depends on the choice of 
an integer valued vector v in the orthogonal space V^. 
Accurate discussion of this issue requires tools from Com- 
mutative Algebra. If the energy function U takes its values 
on a lattice 0, A, 2A, . . . , Z\ > 0, we can choose integer val- 
ued random variables wi, . . . , vn^2 to be a linear basis of 
the orthogonal space V-^. In such a case, we have a finite 
system of binomial equations 



j(a:)>0 



X : v(x)<0 



p{x) 



tgiv- [x) 



(39) 



1, . . . , iV — 2, which is equivalent to the original model 
. However, it is not generally true that the finite sys- 
tem (|39|) is equivalent to the infinite system p8|) . 

In the classical case k = 0, the polynomial invariants 
of the Gibbs model form a polynomial ideal I in the poly- 
nomial ring Q[p(a:): x € Q]. The ideal I admits, because 
of the Hilbert Theorem, a finite generating set. The dis- 
cussion of various canonical forms of such a generating set 
is one of the issues of Algebraic Statistics. 



Example We specialize our discussion to the toy example 
which is discussed in [15] for k = 0. Let Q = {1, 2, 3, 4, 5}, 
U(l) = U(2) = 0, t/(3) = 1, C/(4) = C/(5) = 2. The 
following display shows a set of integer valued Vj, j = 
1, 2, 3 of the orthogonal space V-^. 



1 U VI 



V2 V3 



1 

-1 











1 

-1 



1 
1 

-4 
1 
1 



(40) 



Equation ([55)1 becomes: 



>(l)=p(2) 
p(4) = p(5) 

p(l) p{2) p(4) (g) p(5) = p(3) » " 



(41) 



A strictly positive probability density belongs to the k- 
Gibbs model ([^Hl) if and only if it satisfies ([JT]) . The set of 
all polynomial invariants of the Gibbs model is a polyno- 
mial ideal and the question of finding a set of generators is 
intricate. A non strictly positive density that is a solution 
of gl]) is either p(l) = p{2) = p{3) = 0, p(4) = p{5) = 
1/2, or p{l) = p{2) = 1/2, p{3) = p(4) = p{5) = 0. These 
two solutions are the uniform distributions on the sets of 
values that respectively maximize or minimize the energy 
function. 



From ([^5]) we can derive another form of the last equa- 
tion in the system (|4ip : 



p«(4) - 



+ ^^^(5) - 



^"(5) 



4 p«(3)- 



p«(3) 



which is another algebraic form of the K-Gibbs model. 

Jet another algebraic presentation is available. In the 
model p9|) . we introduce the new parameters 



Co =exp^ (-V'k(6')): 
Ci = exp„ {6) , 



(42) 
(43) 



so that 



p{x; 0) = exp, {0U{x)) ® exp, (-MO)) , (44) 



Co ® cr 



<»U(x) 



(45) 



The probabilities are K-monomials in the parameters Co i Ci i 

>(l)=p(2) = Co 

p(3) = Co ® Ci 

K (46) 

p{4) = p{5) = Co Ci 

K 

In algebraic terms, such a model is called a toric model. 
It is interesting to note that in (|46[) the parameter C,q 
is required to be strictly positive, while the parameter 
Ci could be zero, giving rise the uniform distribution on 
{1,2} = {x: U{x) = 0}. The other limit solution is not 
obtained by ([46| . The algebraic elimination of the inde- 
terminates Coj Ci in (BHj) will produce back polynomial in- 

(g)2 

variants. For example, from {C,o®C,i)^ — (Co)'8i(Co'X'Ci" ); 

® 2 

we get ^(3)" —p{2)®p{b). 

The monomial parametric representation of the type 
()46|) . together with their ability to parametrize the limit 
cases is discussed in [IS]. The notion of toric model and 
the related discussion could be applied to the model ([27[l 
which we are not discussing here. 



3 Charts 

We use the coordinate-free formalism of differential geom- 
etry of [28]. Let us fix a K e]0, 1[. In order to construct 
an atlas of charts, i.e. mappings from the set of strictly 
positive probability densities to some model vector 
space, we consider each chart as associated to a density 
p G ^Ay. Such a p is used as a reference for every other 
density g of a suitable subset of 7W>, via the statistical 
notion of likelihood q/p. The theory is not restricted to a 
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finite sample space. However, we do not discuss here the 
technicahties involved in the non finite case, but we use a 
general notation. 

We first define a K-divergence. li q,p E A^> satisfy the 
condition 



-q) 



(47) 



then the K-divergence is defined to be 

D,ip\\q)^'Ep 



In. ( P 



1 

2k' 



(48) 



The first of the two conditions in (|¥7)l is always satisfied 
because 



E„ 



< E 



= 1. 



(49) 



The second condition is non-trivial unless the state space 
is finite, because we are assuming 

■ < +00. (50) 



When such a condition is not satisfied the value of the 
expectation in (|48|) is +00. We are not interested in this 
case here. 

The strict convexity of — In^ implies 



D^{p\\q)^Ep 



ln„ 



q 
p 

In.- ( E, 



> 



In. (1)^0. (51) 



with equality if, and only ii q — p. 

The manifold we want to define has to be modelled 
on the Lebesgue space of centered (l/K)-p-integrable ran- 



dom variables Lq^"'(j)), i.e. v S L^'^^ip) if, and only if, 
Ep [kl^^''] = / \v\^/'^pdp, < +00 and Ep [v] = J vpdfi ^ 
0. At each p there is a different model space, so that an iso- 
morphism between them has to be provided. The simplest 
isometric identification between L^/'^{pi) and L^^'^{p2) is 



1/k, 



L'/^ipi)3u^ (^) ueL'/^ip2) 



(52) 



In fact, 



1/k 



P2d^i^ I (53) 



We use a variation of the formalism used in the ex- 
ponential case [TT]. The new manifold will be called k- 
statistical manifold. We define the subset £p of > by 



f 



q e My 

q G M> 

q e M> 



q 
p 

q p 

p' q 
p 
q 



G L\p) 

e L\p) 



(54) 



The assumption in is stronger than that in ([5(I)l . be- 
cause here we want Ep[p/g] = Jp^/qd^j, < +00, and 
Fip[p^/q'^] < (Ep[p/q])'^. Indeed, we are assuming more 
than the mere existence of DK{p\\q), unless the state space 
is finite. It should be noticed that the set £p does not de- 
pend on K g]0, 1[. The condition J p'^/qdfi < +00 can be 
interpreted with the aid of the isometry (|52p . As 



eLHp) 



(55) 



by definition, we can map it to L « (g) by multiplying it by 
itself, to get 



(56) 



Each of the sets £p, p G Aiy, is going to be the domain of 
a chart, and this will define an atlas of charts because of 
the covering My = iipSp] a connected component of the 
manifold will be the union of overlapping £'s. 

li q (z £p, then q is almost surely positive and we can 
write it in the form q = exp^ (w) p, where 

v^luj^)^ ; e L^'-{p) (57) 

\PJ 2k 

The expected value at p of w = In, 



E„ 



In. 



= -D^{p\\q), 



so that we can write every g S as 

q = exp. {u - Di^{p\\q))p, 



(58) 



(59) 



where w is a uniquely defined element of the set of centered 
(l/K)-p-integrable random variables L^i^'^^p), namely 



= In. 
= In. 



E„ 



In. 



D4p\\q)- 



Conversely, given any u G ^^''(p), the real function 
■0 I— > Ep [exp^ {u — is continuous and strictly decreas- 
ing from +00 to 0, therefore there exists a unique value of 
ip, say tpK.p{u), such that 



(60) 



Ep [exp^ (u - ip^,p{u))] = 1, 

so that 

q = exp^ (u - ?/'.,p(u))p e £p C M>. 
The 1-to-l mapping 

£p B q ^ u e Ly'^{p) 



(61) 
(62) 

(63) 
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is our chart. By comparing (|62p with (|59p we obtain ipf^^p{u) 
Z)k(p||'?)7 where u is the image of q in the chart at p. I.e. 
the p-chart representation of the functional q i— > Di^{p\\q) 
is u I— > ipK.piu). A second example is the representation of 
q 1-^ D,,{q\\p): 



D^{q\\p) - E, 



In. ( ^ 

P 



(64) 



The second derivative of u i— s- exp^ (u — ipK,p{u)) in the 
directions v and w is the first derivative in the direction 
w of u 1-^ exp.'(u — 'ipK,p{u)){v — Dil:K,p{u)v), therefore it 
is equal to 

exp^"(u - ■i/'K,p(w))(w - D-tpi^^p{u)v){w - Dtljf,^p{u)w) 

- exp^'(u - ip^^p{u))D'^^p^^p{u)vw. (73) 



The functional V'k.p is of key importance in the case 
K = 0, where it is the cumulant functional of the random 
variable u: 

V'p(m) = lnEp[e"] (65) 

The proof of differentiability properties, in the non- 
finite case with k = is not trivial, see e.g. [H], because 
the domain of ^l)p functional has a non-trivial description 
in that case. For 0, the domain of V'k,* is the full space 

Lq^'^ (j)), and we can compute the directional derivatives 



Dil},,^p{u)v = ^V'k,p(" - 



tv) 



(66) 



The random variable in ()73p has zero p-expectation, so 
that 

D^'ijj^^p{u)vw = 
Ep [exp."(M - ■0k,p(m))(w - Di)^,p{u)v){w - Dilj^^p{u)w)] 



Ep[exp^'(u--0«,p(u))] 



(74) 



If ui = u 7^ 0, then D'^ipi^,p{u)vv > 0, therefore the func- 
tional ipK.p is strictly convex. For u = we obtain 



t=o 



-DVk,p(0)ww = Covp {u,v) 



(75) 



and 



dsdt 



i^K,p{u + sv + tw) 



For K — 0, the Frechet derivatives of ipp are 

Dipp[u)v = Eg [v] , 
D'^ipp{u)vw = Cavq {v, w) , 



(67) 



(68) 
(69) 



where v and w are the directions of derivation and q = 

For K 7^ the computation of the directional derivative 
of ([6T|) gives 



= 0. (70) 



Ep [exp.' (u - i>^,piu)) {v - DV'k,p(w)w)] 
V - D%l;^,^p(u)v 



E„ 



^1 + K^{U - V'k,p(m))^ 



It follows from ^ with u = that Z)i/'«:,p(0) = 0. 

Otherwise, let g|p denote the density proportional to 

exp^'(u-'0K,p(u))p = 



1 



.^(ln.(^))' 



(71) 



see [TU], where such a density is called escort probability. 
The explicit expression for the derivative is 



D^i^^p{u)v = E, 



q\p I 



(72) 



which is the same as (j68p , but the expectation is computed 
with respect of the escort density q\p. Later we will give a 
geometric interpretation of q\p. 



We do not have a similar interpretation for m 7^ 0, but see 
the discussion of parallel transport below. 



4 The /(-statistical manifold and its tangent 
bundle 

Assume now we want to change of chart, that is we want 
to change the reference density from p to p to represent 
a q that belongs both to £p and to Sp. From now on, we 
skip the discussion of the non-finite case; such a discussion 
will be published elsewhere. In the finite state space case, 
the integrability conditions of ([54|) are always satisfied, 
so that all the chart's domains £p are equal to Al>. The 
application of (|60p and ([62|) to the change 



ueLl/'-ip) 



(76) 



gives 



= IUk 



Eg 



IUk 



In. exp^ (u - ■(/'k.p('«)) - 



Ep 



In^ exp^ {u ~ ipf,,p{u)) - 
* P 



(u - ipK.p{u)) © Iuk - 

\p 



Ep 



{u - i/'k,p(w)) ffi In. - 
' P 



For K = the change of chart is an affinc function: 



u — u + 



ln(|)-E, 



ln(^ 



(77) 



(78) 
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with linear part 

V ^ V - Ep[v] . (79) 

For K 0, the derivative in the direction v of the 
change of chart at u is obtained from the derivation for- 
mula dH]) . It has the form A-Ep [A] , with 



A = 



1 + 



K^ln„ (u-'(/'«,p(u))' 

1 + k"^ {u- il}i^^p(u)f 
x{v- Dtp,,p{u)v) . (80) 



We are then led to the study the tangent spaces of the 
K-statistical manifold. Let pg, 9 s] — 1, 1[, be a curve in 

Pe = exp^ {ue - ■0K,po("e))po- (81) 
In the chart at p the velocity vector is given by 

ue e Ll^^ip). (82) 

We identify the tangent space at po with the space of 
the random variables uq. In general, the tangent space at 

p e My is defied to be Tp — ^^'^(p). Derivation with 
respect to 9 of ([8T|) gives 



El 
Pe 



(83) 

In particular, po/po = ua- To extend this to the general 9, 
let us observe first that ([70]) . with p = po, u = ue, v — tig, 
shows that Ep^ [pe/pg] = 0. The second factor in the right 
end side of ifSS]) is, because of ([72]). 



lie - D%j};^^p{ug)(ue) = ug - Ep^|p„ [lig] . (84) 
The first factor is 



1 + In 




-1/2 



(85) 



Let us define a parallel transport U^p mapping the 

tangent space at p, i.e. Tp = Lq^^{p), on the tangent space 

at p, i.e. Tp = L^^'^ip) as follows. If u G L^^'^ip), then the 
random variable 



u - Ep|p [u] 



«^(ln.(f))' 



(86) 



belongs to Tp — Ly^{p), and we define u — Upp{u). In 
fact, Ep [u] =0, and, moreover. 



1 + In 




-1/2 



< 2 



so that the conclusion 



Eii 



1/k 



< 2i/''E„ 



|1/k 



follows from the isometry ([5^ . 

The quantity p'g /pe in (|83p has the following remark- 
able interpretation in terms of the parallel transport: 



pe ^ 



,(we), 



(89) 



i.e. it is the velocity vector at 9, transported to the tangent 
space at pg. 

In this setting, we can define K-exponential models in 
a non parametric way as 



9 = exp^(u-?AK,p(u))p, u£V, 



where F is a linear sub-space of iy"(p). Each V ^ V 
is called a canonical variable of the /t-exponential model. 
The implicit representation of the exponential model (|90p 
is 



(90) 



E„ 



ln„ 



0, we V^. 



(91) 



where C Lq ^^''(p) is the orthogonal of V . In Statis- 
tics, awe is called a constraint of the log-linear model. 
We could derive, as we did in the K-Gibbs model exam- 
ple, for lattice-valued constraint variables v, the relevant 
polynomial-type equations based of the deformed product 
operation. 

In particular, a one dimensional exponential model is 
characterised by a one-dimensional space V = Span(u), 

u e Lq^'^ {p). Let us show that such a model satisfies a 

differential equation on the manifold. Given u e Lq^'"{po), 
for each g e fp, we can define the mapping 



£p3q^U^Ju)eTg, 



(92) 



which is a vector field of the K-manifold. The velocity of a 
curve pg = exp^ {ug — ipK,pa{'Ue)) Po is represented in the 
chart at pg by pg/pg because of (|89)) . therefore we can 
consider the differential equation 



pe 



t/po,P«H> (93) 



whose solution is 



pg = exp^ {9u - ■>p^^p{9u)) Pa, 



(94) 



cf. ([83]) . Equation (|95|) implies that the one dimensional 
exponential model has tangent vectors which are trans- 
ported into each other by the parallel transport If^, i.e. 
the model is J7''-auto-parallel. 

The one dimensional exponential model is one of the 
simplest example of differential equation on the ^-statistical 
manifold. The treatment of evolution equations for densi- 
j-gy-j ties is one of the main motivation for introducing a man- 
ifold structure on the set of densities, cf. [29], [30], [3T] . 



8 



Giovanni Pistone: K-exponential models from the geometrical viewpoint 



5 Statistical manifolds 

In this final section, we go back to the general setting and 
briefly discuss how the K-statistical manifolds we have de- 
fined relate with the previous construction of the exponen- 
tial statistical manifold. (f2,JF, /i) is a generic probability 
space, ^ is the set of real random variables / such that 
/ f = 1, A4> the convex set of probability densities, 
Al> the convex set of strictly positive probability densi- 
ties: 

M>CM>CM^ (95) 

In the classical case, i.e. k = 0, differentiable man- 
ifolds are defined on both A^> and A4^. Such manifolds 
are both modeled on suitable Orlicz spaces, see [32^. Orlicz 
spaces are a generalization of Lebesgue spaces, where the 
norm is defined through a symmetric, null at zero, non- 
negative, convex, with more than linear growth, function 
called Young function. Let # be any Young function with 
growth equivalent to exp, e.g. <P{x) — cosh(x) — 1, with 
convex conjugate !f , e.g. !Z'(j/) = (1 + \y\) log (1 + \y\) - \y\. 
The relevant Orlicz spaces are denoted by and L*^, re- 
spectively. Both these Banach spaces appear naturally in 
Statistics. A random variable u belongs to the space L* 
if, an only if, its Laplace transform exists in a neighbor- 
hood of 0, that is, the one dimensional exponential model 
p{9) oc e^" is defined for values of the parameter in a neigh- 
borhood of 0. A density function / has finite entropy if, 
and only if, it belongs to the space L*. We denote by 
Lf , L'^ the sub-spaces of centered random variables. If 
the sample space is not finite, then the exponential Orlicz 
space is not separable and the closure of the space of 
bounded functions is different from . There is a natural 
separating duality between Lg and Lq , which is given by 
the bi-linear form 

{u,v) i—> J uv dfi. (96) 

It can be proved that the cumulant generating functional 
ipp{u) = Ep [e"], u G Lq (p), p G My is positive, strictly 
convex, analytic. The interior of the proper domain of tjjp, 

Sp = {ue L^(p) : ijp{u) < +c3o}° (97) 
defines the so called maximal exponential model 

fo,p- {e"-'^-("^p:ue5p} (98) 

In our case, i.e. < k < 1, we could consider 

cosh„ (x) - 1 ^(exp^ (x) + exp^ (-x)) - 1 

1 / fj; dt r-x dt \ 

= ± / e v^i+^ -I- e j - 1 

= cosh( / ^ ) - 1. (99) 

We obtain a Young function equivalent to Ixj^/", so that 
the related Orlicz space is just L^^'^ and the K-statistical 



manifolds appears as a generalisation of the exponential 
construction. 

If g e ^Q.p, then q — exp {u — ipp{u)) .p. If —u G Sp, 
then q G £p, so that q — exp^ {v — ipk,p{v)) for a suitable 

V G Lq^'^{p). It follows that 

exp (w - -ippiu)) = exp^ {v - tpk.piv)) , (100) 

therefore the e-coordinate u and K-coordinate v are related 
by the equation 

6 Conclusion 

We have presented a non parametric construction of the 
statistical manifold based on the use of the centered In^- 
likelihood as a functional coordinate. The basics of the 
formalism are discussed in Section 3 and 4. A notable dif- 
ference from the standard case k = is the absence of 
simple formulae for the functional and its derivatives. 
It has been shown that the derivation of such quantities 
is related with a suitably defined parallel transport of the 
tangent bundle; for such a parallel transport the exponen- 
tial models are auto-parallel. This, in turn, should lead 
to a theory of evolution equations on the K-manifold. The 
algebraic features of the K-exponential models for lattice 
contrasts have been described on an example. The relation 
of this construction to a-geometries of [17] is not discussed 
here nor is the relation with other geometric construction 
based on different divergences, e.g. Tsalhs entropy [33] . 
When comparing our formalism with similar construction 
of the statistical manifold based on other transformations 
derived from the power transform, it should be noticed 
that the K-logarithm has a distinctive advantage of hav- 
ing range M while retaining a simple algebraic character. 

This piece of research was started in occasion of the SigmaPhi 
2008 conference, Kolympari, GR. The Author thanks the or- 
ganizers of the symposium Fisher Information and Geometry 
for the invitation to present a paper there and to R. Trasarti- 
Battistoni for the conversations during the conference. A pre- 
liminary version of this paper has been discussed with G. Ka- 
niadakis and was presented at MSRI Berkeley CA. The work 
has been supported by DIMAT, Politecnico di Torino, by MSRI 
Berkeley CA, and by SAMSI, Research Triangle Park NC. 
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